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DESCRIPTION 

METHOD OF DEFINING THE DIFFERENTIATION GRADE OF TUMOR 

Technical Field 

The present invention relates to a method of defining 
the differentiation grade of tumor. More particularly, the 
present invention relates to a method of defining the 
differentiation grade of tumor by selecting genes and/or 
proteins whose expression level correlates with each 
differentiation grade of hepatocellular carcinoma (HOC) , 
measuring the expression of the genes and/or proteins of human 
tumor tissues in each differentiation grade. The present 
invention also relates to the use of these genes and/or proteins 
for diagnosing the differentiation grade of HCC and for 
screening anti-cancer agents for HCC treatment. 

The present invention also relates to a kit for 
performing the method of the present invention comprising DNA 
chips, oligonucleotide chips, protein chips, peptides, 
antibodies, probes and primers that are necessary for DNA 
microarrays, oligonucleotide microarrays, protein arrays, 
northern blotting, in situ hybridization, RNase protection 
assays, western blotting, ELISA assays, reverse transcription 
polymerase-chain reaction (hereinafter referred to as RT-PCR) 
to examine the expression of the genes and/or proteins whose 
expression level correlates with the differentiation grade of 
tumor . 

Background Art 

Cancer is the major causative of death in the world. 
Particularly, hepatocellular carcinoma (HCC) is one of the most 
common cancers worldwide, which represents a major 
international health problem because of its increasing 
incidence in many countries (Schafer, D.F. and Sorrell, M.F. 
Hepatocellular carcinoma. Lancet 353, 1253-1257 (1999) , 
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Colombo, M. Hepatitis C virus and hepatocellular carcinoma, 
Semin. Liver Dis. 19, 263-269 (1999), and Okuda, K. 
Hepatocellular carcinoma, J. Hepatol. 32, 225-237 (2000)). 
Chronic hepatitis C virus (HCV) infection is one of the major 
risk factors for HCC as well as hepatitis B virus (HBV) 
infection, alcohol consumption, and several carcinogens such, 
as af latoxin Bl (Okuda, K. Hepatocellular carcinoma, J. Hepatol. 
32, 225-237 (2000)) . Several therapies have been adopted for 
the treatment of HCC. Those include surgical resection, 
radiotherapy, chemotherapy, and biological therapy including 
hormonal and gene therapy. However, none of these therapies 
could cure the disease. One of the major problems of HCC 
treatment is that characteristics of cancer cells change during 
the development and progression of the disease. Particularly, 
changes in the differentiation grade of tumor cells are 
apparent and frequent. Such changes alter the ability of tumor 
cells to invade and metastasize and also the sensitivity of 
cancer cells to different therapies, causing resistance to 
anti-cancer agents. If the changes in the characteristics of 
cancer cells are precisely diagnosed and managed, cancer 
therapy will be more effective. 

Previous studies suggested the involvement of tumor 
suppressor genes and oncogenes such as p53, fi-catenin, and 
AXINl genes in hepatocarcinogenesis (Okabe, H. , Satoh, S. , Kato , 
T., Kitahara, 0., Yanagawa, R. , Yamaoka, Y. , Tsunoda, T., 
Furukawa, Y. , and Nakamura, Y. Genome-wide analysis of gene 
expression in human hepatocellular carcinomas using cDNA 
microarray: identification of genes involved in viral 
carcinogenesis and tiomor progression. Cancer Res. 61, 
2129-2137 (2001)). It has also been suggested that the 
development of HCV-associated HCC can be characterized by the 
pathological evolution from early to advanced tumor, which 
correlates with dedif f erentiation of cancer cells (Kojiro, M. 
Pathological evolution of early hepatocellular carcinoma. 
Oncology 62, 43-47 (2002)). Particularly after introduction 
of DNA microarray technologies into medical science (Schena, 
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M. , Shalon, D., Davis, R.W. , and Brown, P.O. Quantitative 
monitoring of gene expression patterns with a complementary DNA 
microarray. Science 270 , 467-470 (1995), DeRisi, J., Penland, 
L., Brown, P.O., Bittner, M.L., Meltzer, P.S., Ray, M. , Chen, 
Y. , Su, Y.A. , and Trent, J.M. Use of a cDNA microarray to analyse 
gene expression patterns in human cancer, Nat. Genet. 14, 
457-460 (1996) ) , many studies showed gene-expression patterns 
relating to some aspects of HCC (Lau, W.Y., Lai, P.B. , Leung, 
M.F., Leung, B.C., Wong, N. , Chen, G. , Leung, T.W., and Liew, 
C.T. Differential gene expression of hepatocellular carcinoma 
using cDNA microarray analysis , Oncol. Res. 12, 59-69 (2000), 
Tackels-Horne, D. , Goodman, M.D. , Williams, A.J. , Wilson, D.J. , 
Eskandari, T., Vogt, L.M,, Roland, J.F., Scherf, U., and 
Vockley, J.G. Identification of differentially expressed genes 
in hepatocellular carcinoma and metastatic liver tumors by 
oligonucleotide expression profiling. Cancer 92, 395-405 
(2001), Xu, L., Hui, L. , Wang, S., Gong, J., Jin, Y., Wang, Y. , 
Ji, Y., Wu, X., Han, Z., and Hu, G. Expression profiling 
suggested a regulatory role of liver-enriched transcription 
factors in human hepatocellular carcinoma. Cancer Res. 61, 
3176-3681 (2001), Xu, X.R., Huang, J., Xu, Z.G., Qian, B.Z., 
Zhu, Z.D. , Yan, Q. , Cai, T. , Zhang, X. , Xiao, H.S. , Qu, J. , Liu, 
F., Huang, Q.H., Cheng, Z.H., Li, N.G., Du, J.J., Hu, W. , Shen, 
K.T. , Lu, G. , Fu, G. , Zhong, M. , Xu, S.H. , Gu, W.Y. , Huang, W. , 
Zhao, X.T., Hu, G.X., Gu, J.R. , Chen, Z., and Han, Z.G. Insight 
into hepatocellular carcinogenesis at transcriptome level by 
comparing gene expression profiles of hepatocellular carcinoma 
with those of corresponding non-cancerous liver. Proa. Natl. 
Acad. Sci. U.S.A. 98, 15089-15094 (2001), Okabe, H. , Satoh, S., 
Kato, T. , Kitahara, O. , Yanagawa, R. , Yamaoka, Y. , Tsunoda, T. , 
Furukawa, Y. , and Nakamura, Y. Genome-wide analysis of gene 
expression in human hepatocellular carcinomas using cDNA 
microarray:- identification of genes involved in viral 
carcinogenesis and tumor progression. Cancer Res. 61, 
2129-2137 (2001), Shirota, Y. , Kaneko, S., Honda, M. , Kawai, 
H.F., and Kobayashi, K. Identification of differentially 
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expressed genes in hepatocellular carcinoma with cDNA 
microarrays, Hepatology 33 , 832-840 (2001), Delpuech, O., 
Trabut, J.B., Carnot, F., Feuillard, J., Brechot, C, and 
Kremsdorf, D. Identification, using cDNA macroarray analysis, 
of distinct gene expression profiles associated with 
pathological and virological features of hepatocellular - 
carcinoma. Oncogene 21, 2926-2937 (2002) , lizuka, N., Oka, M. , 
Yamada-Okabe , H. , Mori, N. , Tamesa, T. , Okada, T. , Takemoto, 
T., Tangoku, A., Hamada,' K. , Nakayama, H., Miyamoto, T., 
Uchimura, S., and Hamamoto, Y. Comparison of gene expression 
profiles between hepatitis B virus- and hepatitis C 
virus-infected hepatocellular carcinoma by oligonucleotide 
microarray data based on a supervised learning method. Cancer 
Res. 62, 3939-3944 (2002), and Midorikawa, Y., Tsutsumi, S., 
Taniguchi, H. , Ishii, M. , Kobune, Y. , Kodama, T. , Makuuchi, M. , 
and Aburatani, H. Identification of genes associated with 
dedif f erentiation of hepatocellular carcinoma with expression 
profiling analysis, Jpn. J". Cancer f?es. 93, 636-643 (2002)). 
Among them, two studies profiled gene expression of HCC in 
relation to its development (Okabe, H., Satoh, S., Kato, T., 
Kitahara, 0. , Yanagawa, R. , Yamaoka, Y. , Tsunoda, T. , Furukawa, 
Y., and Nakamura, Y. Genome-wide analysis of gene expression 
in human hepatocellular carcinomas using cDNA microarray: 
identification of genes involved in viral carcinogenesis and 
tumor progression. Cancer Res. 61, 2129-2137 (2001) and 
Midorikawa, Y. , Tsutsumi, S. , Taniguchi, H. , Ishii, M. , Kobune, 
Y., Kodama, T. , Makuuchi, M. , and Aburatani , H. Identification 
of genes associated with dedif ferentiation of hepatocellular 
carcinoma with expression profiling analysis, Jpn. J. Cancer 
Res. 93, 636-643 (2002)) . However, nothing is known about genes 
and/or proteins that characterize and/or regulate each 
differentiation grade of HCC during the course of oncogenesis 
and development of HCV-associated HCC. Genes and/or proteins 
that regulate the differentiation grade of HCC can be used for 
diagnosing the differentiation grade of HCC and for screening 
anti-cancer agents for the treatment of HCC arising from 
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chronic HCV infection. 

In the present invention, the inventors describe a 
method of diagnosing the differentiation grade of tumor and 
screening anti-cancer agents for the treatment thereof. 
Particularly, the inventors describe a method of identifying 
40 or more genes and/or proteins whose expression correlates 
with the differentiation grade of HCC, and use of these genes 
and/or proteins for diagnosing the.dif f erentiation grade of HCC 
and for screening anti-cancer agents for the treatment of HCC 
in different grades . More particularly, the inventors describe 
a method of predicting non-cancerous liver, pre-cancerous 
liver, and each differentiation grade of HCC with 40 genes 
and/or proteins. 

Disclosure of the Invention 
Summary of the Invention 

Hepatocellular carcinoma (HCC) is one of the most common 
cancers worldwide. However, there is no therapy that can cure 
the disease. This is presumably due to sequential changes in 
characteristics of cancer cells during the development and 
progression of the disease. Particularly, progression of 
cancer is often associated with the changes of differentiation 
grade of tiomor cells. Diagnosis and management of such changes 
of cancer cells will make cancer therapy more effective. In 
the present invention, genes whose expression correlates with 
oncogenesis and development of HCC are identified by 
oligonucleotide microarray representing approximately 11,000 
genes from 50 hepatitis C virus (HCV) -associated HCC tissues 
and 11 non- tumorous (non-cancerous and pre-cancerous) liver 
tissues . 

Differentiation states are divided into 5 grades. 
Non-cancerous liver (LO) is the liver that is histologically 
normal and is seronegative for both hepatitis B virus surface 
antigen and HCV antibody. Pre-cancerous liver (LI) is the liver 
that is HCV- infected and is histopathologically diagnosed as 
chronic hepatitis or liver cirrhosis. Well differentiated HCC 
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(Gl) is the HCC consisting of cancer cells that are 
characterized by an increase in cell density with elevated 
nuclear/cytoplasm ratios compared to normal hepatocytes but 
show the morphologies similar to normal hepatocytes. 
Moderately differentiated HCC (G2) is the HCC consisting of 
cancer cells that are large and hyperchromatic . There are 
trabecular- or gland-like structures in cancer cell nest in G2 
grade. Poorly differentiated HCC (G3) is the HCC consisting 
of the cancer cells that are pleomorphic or multinucleate . The 
tumor grows in solid masses or cell nest devoid of architectural 
arrangement in G3 grade. Gl, G2 , and G3 tumors correspond to 
types I, II, and III of Edmondson & Steiner classification, 
respectively (Edmondson, H.A. and Steiner, P.E. Primary 
carcinoma of the liver: a study of . 100 cases among 48,900 
necropsies. Cancer!, 462-504 (1954)). 

A supervised learning method followed by a random 
permutation test of oligonucleotide microarray data is used to 
select genes whose expression significantly changes during the 
transition from non-cancerous liver without HCV infection (LO) 
to pre-cancerous liver with HCV infection (LI) , from LI to well 
differentiated HCC (Gl) , from Gl to moderately differentiated 
HCC (G2) , and from G2 to poorly differentiated HCC (G3) . 
Self-organizing map with all the selected 40 genes whose 
expression is significantly altered in each transition stage 
can correctly predict the differentiation grade of tumor 
tissues. Thus, these genes can be used for diagnosing the 
differentiation grade of HCC and for screening anti-cancer 
agents for the treatment of HCC in each differentiation grade . 

Detailed Description of the Invention 

In the present invention, human hepatocellular 
carcinoma (HCC) tissues and non-tumorous (non-cancerous and 
pre-cancerous) liver tissues are used. HCCs with HCV infection 
are used for analyzing HCCs. Presence of HCV and/or HBV 
infection can be determined either by immunoreactivity against 
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anti-HCV antibody and anti-HBV antibody or by amplifying HCV 
and/or HBV genome by PGR. The differentiation grade of HCC can 
be determined by histopathological examination, and HCCs are 
classified into well differentiated HCC (Gl) , moderately 
differentiated HCC (G2) , and poorly differentiated HCC (G3) . 
Non-tumorous liver samples can be obtained from patients who 
underwent hepatic resection for benign or metastatic liver 
tumors. A liver sample without HCV infection is classified as 
non-cancerous liver (LO)-, and that with HCV infection is 
classified as pre-cancerous liver (LI) . After resecting liver 
tissues during surgery, it is preferable that tissues are 
immediately frozen in liquid nitrogen or acetone containing dry 
ice and stored at between -70 and -80°C until use. The tissues 
may or may not be embedded in O.C.T. compound (Sakura-Seiki, 
Tokyo, Japan, Catalog No. 4583). 

The expression of genes and/or proteins of HCC tissues 
and non-tumorous liver tissues can be analyzed by measuring the 
level of RNA and/or proteins. In most cases, the level of RNA 
and/or proteins is determined by measuring fluorescence from 
substances including fluorescein and rhodamine, 
chemiluminescence from luminole, radioactivity of radioactive 
materials including ^h, "c, ^^S, '^^P, , and ^'^I, and optical 
density. For example, the expression level of RNA and/or 
proteins is determined by known methods including DNA 
microarray_ (Schena, M. et al. Quantitative monitoring of gene 
expression patterns with a complementary DNA microarray. 
Science 270, 467-470 (1995) and Lipshutz, R.J. et al. High 
density synthetic oligonucleotide arrays, Nat. Genet. 21, 
20-24 (1999) ) , RT-PCR (Weis, J.H. et al. Detection of rare mRNAs 
via quantitative RT-PCR, Trends Genet. 8, 263-264 (1992) and 
Bustin, S.A. Absolute quantification of mRNA using real-time 
reverse transcription polymerase chain reaction assays, J. Mol. 
Endocrinol. 25, 169-193.(2000) ) , northern blotting and in situ 
hybridization (Parker, R.M. and Barnes, N.M. mRNA: detection 
in situ and northern hybridization. Methods Mol. Biol. 106, 
247-283 (1999)) , RNase protection assay (Hod, Y.A. Simplified 
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ribonuclease protection assay, BioTechniques 13, 852-854 
(1992) and Saccomanno, C.F. et al . A faster ribonuclease 
protection assay, BioTechniques 13, 846-850 (1992)), western 
blotting (Towbin, H. et al. Electrophoretic transfer of 
proteins from polyacrylamide gels to nitrocellulose sheets, 
Proc. Natl. Acad. Sci. U.S.A. 76, 4350-4354 (1979) andBurnette, 
W.N. Western blotting: Electrophoretic transfer of proteins 
form sodiiam dodecyl sulfate-polyacrylamide gels to unmodified 
nitrocellulose and radioiodinated protein A, Anal. Biochem. 
112, 195-203 (1981)), ELISA assay (Engvall, E. and Perlman, P. 
Enzyme-linked immunosorbent assay (ELISA) : Quantitative assay 
of immunoglobulin G, Iwmuno chemistry 8 , 871-879 (1971)), and 
protein array (Merchant, M. and Weinberger , S.R. Review: Recent 
advancements in surface-enhanced laser 

desorption/ionization-time of flight-mass spectrometry. 
Electrophoresis 21, 1164-1177 (2000) andPaweletz, CP. et al . 
Rapid protein display profiling of cancer progression directly 
from human tissue using a protein biochip. Drug Dev. Res. 49, 
34-42 (2000)). 

Genes and/or proteins that are differently expressed in 
each differentiation grade of HCC and non-tumorous 
(non-cancerous and pre-cancerous) liver are selected by 
comparing the expression level of genes and/or proteins among 
HCC tissues in each differentiation grade and non-tumorous 
liver tissues. Genes and/or proteins that are differentially 
expressed between non-cancerous liver (LO) and pre-cancerous 
liver that have been infected with HCV (LI) are identified by 
comparing the expression level of each gene and/or protein 
between non-cancerous liver tissues and pre-cancerous liver 
tissues. Genes and/or proteins that are differentially 
expressed between pre-cancerous liver (LI) and well 
differentiated HCC (Gl) are identified by comparing the 
expression level of each gene and/or protein between 
pre-cancerous liver tissues and well differentiated HCC 
tissues (HCC(Gl)). Genes and/or proteins that are 
differentially expressed between well differentiated HCC (Gl) 
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and moderately differentiated HCC (G2) are identified by 
comparing the expression level of each gene and/or protein 
between HCC{G1) and moderately differentiated HCC tissues 

(HCC(G2)). Similarly, genes and/or proteins that are 
5 differentially expressed between moderately differentiated 
HCC (G2) and poorly differentiated HCC (G3) are identified by 
comparing the expression leyel of each gene and/or protein 
between HCC(G2) and poorly differentiated HCC tissues 

(HCC(G3)). 

10 

Differences in the expression level of genes and/or 
proteins of non-cancerous liver, pre-cancerous liver, well 
differentiated HCC, moderately differentiated HCC, and poorly 
differentiated HCC can be analyzed and detected by known 

15 methods of statistical analyses. In all experiments for 

comparing the expression level of genes and/or proteins between 
two grades selected from LO , LI, Gl , G2 , and G3, the following 
procedures are taken. 

In the first step, genes and/or proteins with certain 

20 expression level (e.g. genes with expression level greater than 
40 as judged by the arbitrary units by Aff^nnetrix gene chip 
results) in all the HCC samples and in the non-cancerous and 
pre-cancerous liver samples are selected. This selection 
results in certain niomber of genes and/or proteins. Then, the 

25 discriminatory ability of each gene and/or protein to 

discriminate LO from LI, LI from Gl, Gl from G2 , and G2 from 
G3 is determined by the Fisher ratio. The Fisher ratio for a 
gene j is given by 

a]{A) + a:{B) 

30 where is the sample mean of the expression level of gene 

J for the samples in Grade i, and a-J(i) is the sample variance 
of the expression level of gene j for the samples in Grade i. 



9 
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In the second step, the selected genes and/or proteins 
are ranked in the order of decreasing magnitude of the Fisher 
ratio . A random permutation test is also performed to determine 
5 the number of genes and/or proteins to define the 

differentiation grade of HCC. In the permutation test, sample 
labels are randomly permuted between two grades to be compared, 
and the Fisher ratio for each gene and/or protein is again 
computed. This random permutation of sample labels is repeated 

10 1,000 times. The Fisher ratios generated from the actual data 
are assigned Ps based on the distribution of the Fisher ratios 
from randomized data. From the distribution of the Fisher 
ratios based on the randomized data, the genes and/or proteins 
that are determined to be statistically significant in two 

15 grades by the random permutation test are selected. More 
particularly, the genes and/or proteins that have the P value 
less than 0.005 by the random permutation test between the two 
grades are selected. Among these selected genes and/or 
proteins, 40 genes and/or proteins having the highest Fisher 

20 ratios in each comparison between non-cancerous liver (LO) and 
pre-cancerous liver (LI) , pre-cancerous liver (LI) and well 
differentiated HCC (Gl) , well differentiated HCC (Gl) and 
moderately differentiated HCC (G2) , moderately differentiated 
HCC (G2) and poorly differentiated HCC (G3) are further 

25 selected. 

The ability of the selected 40 genes and/or proteins to 
distinguish non-cancerous liver (LO) from pre-cancerous liver 
(LI) , pre-cancerous liver (LI) from well differentiated HCC 
30 (Gl) , well differentiated HCC (Gl) from moderately 

differentiated HCC (G2) , moderately differentiated HCC (G2) 
from poorly differentiated HCC (G3) is verified by the minimum 
distance classifier and the self-organizing map (SOM) . 

The minimum distance classifier is designed using the 40 
35 genes and/or proteins selected in each transition stage. The 
expression level of each gene and/or protein is normalized to 



10 



wo 2004/090163 



PCT/JP2003/004458 



have zero mean and unit variance using all the training samples 
from two grades. After measuring the Euclidean distance 
between a sample and each mean vector, the sample is assigned 
to the grade of the nearest mean vector. The minimum distance 
classifier that is created with the selected 40 genes and/or 
proteins in each, transition stage is also used to predict the 
differentiation grade of HCC samples whose differentiation 
grade is not determined. To diagnose the differentiation grade 
of HCCs, using pij{A) and fij{B) previously described, the 

sample mean fi. of the mixture consisting of Grades A and B on 
a gene j is obtained by 



where is the number of samples from Grade i. Next, the 
sample variance <5-] of the mixture consisting of Grades A and 
B on the gene j is obtained by 

Using jl^ and a] , fi and V are defined by 
// = [/i„//„...,//,„f 
" 1 



Then, a sample x is normalized by 



11 
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where x is the normalized sample. Using the normalized 
samples, the sample mean vector for each grade is obtained. In 
the minimum distance classifier, the score value is computed 
by 

5 T,{x)=\\x-M^,f -]\x-Ji^,\\^ 
T^(x)=4\x-f/.j^^\\^ -\\x-Ji^^\\^ 
T,(x)^\x~^^, \\'-\\x-Ji^, W 
T,(x)=i\x-p^^\\^-\\x-Ji^^r 

Using four minimum distance classifiers, the differentiation 
10 grade of HCCs can be diagnosed as follows: 

(i) A normalized sample x is classified into Grade LO 

if r,(x)<0, r2U)<0, T,{x)<0 and 7;(jr)<0. 

(ii) A normalized sample x is classified into Grade LI 

if T,(,x)>0, 2;(x)<0, r3(jc)<0 and T^{x) <0 . 
15 (iii)A normalized sample x is classified into Grade Gl 
if 7;(jc)>0, T^{x)>0, Tjix)<.0 and 7;(x)<0. 

(iv) A normalized sample x is classified into Grade G2 
if ri(i)>0, 7;(.i)>0, T^{x)>0 and T^(x)<0. 

(v) A normalized sample x is classified into Grade G3 
20 if 7;(x)>0, 7;(x)>0, 7'3(jc)>0 and T^(x.)>0 . 

The SOM is a neural network algorithm widely used for 
clustering and is well known as an efficient tool for the 
visualization of multidimensional data (Tamayo, P. et al. 

25 Interpreting patterns of gene expression with self-organizing 
maps: methods and application to hematopoietic differentiation, 
Proc. Watl. Acad. Sci. U.S.A. 96, 2907-2912 (1999) and Sultan, 
M. et al. Binary tree-structured vector quantization approach 
to clustering and visualizing microarray data, Bioinf ormatics 

30 Suppl 1, S111-S119 (2002)) . The SOM with all the selected 40 
genes and/or proteins is carried out according to the method 
of MATLAB R13 with the SOM toolbox available in the web site, 

12 
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http://www.cis.hut.fi/projects/somtoolbox/ (Kohonen, 2001). 

. Each set of forty genes and/or proteins whose expression 
is significantly altered during the transition from 

5 non-cancerous liver (LO) to pre-cancerous liver (LI) , from 
pre-cancerous liver (LI) to well differentiated HCC (Gl) , from 
well differentiated HCC (Gl) to moderately differentiated HCC 
(G2) , from moderately differentiated HCC (G2) to poorly 
differentiated HCC (G3) is used for diagnosing the grade of 

10 hepatocarcinogenesis of HCC, and also for screening 

anti-cancer agents that are used for the treatment of HCC in 
each grade. 

Each set of forty genes and/or proteins whose expression 
is significantly altered during the transition from 

15 non-cancerous liver (LG) to pre-cancerous liver (LI) , from 
' pre-cancerous liver (Ll) to well differentiated HCC (Gl) , from 
well differentiated HCC (Gl) to moderately differentiated HCC 
(G2) , from moderately differentiated HCC (G2) to poorly 
differentiated HCC (G3) is expressed in bacteria, eukaryotic 

20 cells, and cell-free systems. Agents that affect the 

expression and/or function of the genes and/or proteins are 
screened by monitoring the expression and/or function. 
Monoclonal antibodies against the proteins are also raised and 
used for treating HCC in different grades . As monoclonal 

25 antibodies , whole mouse monoclonal antibodies , humanized 
antibodies, chimeric antibodies, single chain antibodies, 
divalent single chain antibodies, and/or bi-specific 
antibodies can be raised against the purified proteins, and 
they are used for diagnosing the grade of HCC and the treatment 

30 thereof. 

A kit to examine the expression of the genes and/or 
proteins is also created. The kit consists of the components 
including reagents for an RNA extraction, enzymes for synthesis 
35 of cDNA and cRNA, DNA chips, oligonucleotide chips, protein 
chips, probes and primers for the genes, DNA fragments of 
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control genes , and antibodies to the proteins . The components 
of the kit are easily available from the market. 

Brief Description of the Drawings 
5 Fig. 1 illustrates color displays of the expression of 152 genes 
whose expression was significantly altered during the 
transition from LO to LI (a) , 191 genes whose expression was 
significantly altered during the transition from LI to Gl (b) , 
54 genes whose expression was significantly altered during the 

10 transition from Gl to G2 (c) , and 40 genes whose expression was 
significantly altered during the transition from G2 to G3 (d) . 
Panels e, f , g, and h illustrate expression of the selected 40 
genes, in each transition stage in all the samples. Expression 
of the selected 40 genes whose expression was significantly 

15 altered during the transition from LO to LI (e) , from LI to Gl 
(f) , from Gl to G2 (g) , and from G2 to G3 (h) is shown. The 
selected 40 genes in each transition stage discriminate samples 
before and after the transition. Genes are shown in decreasing 
order of the Fisher ratio and are indicated by GenBank accession 

20 numbers . 

The name of each sample is indicated on top of each photo (e-h) ; 
NL-64, NL-65, NL-66, NL-67, NL-68, ML-69, IL-49, IL-58, IL-59, 
IL-60, IL-62, G1-26T, G1-42T, G1-85T, G1-86T, G1-87T, G1-147T, 
G1-165T, G2-1T, G2-2T, G2-6T, G2-8T, G2-10T, G2-12T, G2-16T, 

25 G2-18T, G2-20T, G2-22T, G2-23T, G2-27T, G2-28T, G2-29T, G2-31T, 
G2-34T, G2-37T, G2-43T, G2-45T, G2-46T, G2-49T, G2-58T, G2-59T, 
G2-60T, G2-62T, G2-89T, G2-90T, G2-105T, G2-151T, G2-155T, 
G2-161T, G2-162T, G2-163T, G2-171T, G2-182T, G3-19T, G3-21T, 
G3-25T, G3-35T, G3-80T, G3-81T, G3-107T, G3-174T, from the 

30 left. 

The name of each gene is indicated on the right of the photo. 
In the case of panel e, M18533, AF035316, AL049942, L27479, 
"Fibronectin, Alt. Splice 1", U19765, X55503, AL046394, 
AB007886, AL050139, AF012086, AI539439, M19828, U92315, D76444, 
35 X0 2 7 61, AF001891, AI400326, AI362017, L13977, D32053, AF038962, 
AL008726, J03909, Z69043, AL080080, M63138, L09159, AF017115, 
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M13560, M36035, U47101, U81554, M21185, D32129, AL022723, 
M83664, U50523, M81757, AF102803, from the top. In the case 
of panel f , M93221, AF079221, V01512, D88587, U12022, AF055376, 
R93527, R92331, U83460, AF052113, H68340, M10943, M13485, 
5 U75744, X02544, M93311, Z24725, U22961, M62403, M35878, U84011, 
AF055030, L13977, D13891, M63175, AB023157, U20982, M14058, 
AL049650, U61232, AI991040, U64444, D63997, X55503, AL080181, 
X76228, AB018330, D76444, U70660, U10323 , from the top. In the 
case of panel g, M87434, M12963, AI625844, M97936, Z99129, 

10 L07633, D50312, U07364, AA883502, M97935, AF061258, AB007447, 
M97935, W28281, M97935, Y00281, D28118, AF104913, AA675900, 
L27706, D32050, M63573, AF014398, X70944, U70671, AA447263, 
AB014569, M23115, D38521, X00351, L11672, X82834, AB007963, 
U76247, X68560, AB015344, AB018327, AF004430, D14697, AB028449, 

15 fromthetop. In the case of panel h, AA976838 , Z11793 , AB002311 , 
Y18004, AL031230, AF002697, AB014596, U49897, AF070570, M80482, 
AI263099, U22961, Z24725, U77594, L34081, M88458, U68723, 
X92098, D10040, AB023194, AF001903, X96752 , AB006202, M75106, 
Y12711, D14662, S87759, Z48199, AF088219, AA453183, D31767, 

20 AB000095, AB006782, M21186, AB002312, U44772, AI541308, Z49107, 
U77735, M38449, from the top. 

Fig. 2 illustrates the validation of the selected 40 genes in 
each transition stage to distinguish the differentiation grade 
25 of HCC. 

In each transition, from LO to LI (a) , from LI to Gl (b) , From 
Gl to G2 (c) , and from G2 to G3 (d) , the minimum distance 
classifier was constructed with the samples in consecutive two 
differentiation grades as indicated by the red bar (training 
30 samples) , and was applied to the samples in the remaining 
differentiation grades as indicated by the black bar (test 
samples) . The resulting classifier classified the test samples 
with the accuracy of 92% (a) , 98% (b) , 84% (c) , and 100% (d) . 

35 Fig. 3 illustrates the result of analysis by the 

self-organizing map (SOM) algorithm of the genes whose 
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expression changed during the transition from non-cancerous 
liver (LO) to pre-cancerous liver (LI) , from pre-cancerous 
liver (LI) to well differentiated HCC (Gl) , from well 
differentiated HCC (Gl) to moderately differentiated HCC (G2) , 
5 and from moderately differentiated HCC (G2) to poorly 
differentiated HCC (G3) . 

Fig. 3a illustrates clusters of the samples (Table 1) . 
Each cell in the SOM grid corresponds to one cluster. The 
vectors of neighboring cells are. usually located close to each 
10 other. 

(m, n) , index of the cell located at m-th row and n-th column. 
NL-XX, samples from non-cancerous liver without HCV infection 
(LO) ; IL-XX, samples from HCV- infected pre-cancerous liver 
(LI) ; Gl-XXT, samples from well differentiated HCC (Gl) ; G2-XXT, 

15 samples from moderately differentiated HCC (G2) ; G3-XXT, 
samples from moderately differentiated HCC (G3) . 
The map shows that the samples clearly formed a sigmoid curve 
in the order of LO , LI , Gl , G2 , and G3 . G2 samples without vessel 
involvement (blue letters) are located close to Gl samples and 

20 G2 samples with vessel involvement (red letters) are located 
close to G3 samples. 

Fig. 3b illustrates the distance between the neighboring 
clusters . 

(m, n) , index of the cell located at m-th row and n-th column. 
25 The color of the cells indicates the distance between the 
neighboring clusters ; a red color means a long distance . The 
red cells in the upper area clearly show that the non-tumorous 
(non-cancerous and pre-cancerous) liver samples and HCC 
samples are relatively far apart in all the selected 40 genes. 

30 

Table 1 illustrates clusters of samples profiled to LO, LI, Gl, 
G2 , and G3 as shown in Fig. 3a. 

Table 2 illustrates clinicopathologic factors of the HCC used 
in the present invention. 
35 Table 3 illustrates top-40 discriminatory genes in LO and LI. 
Table 4 illustrates top-40 discriminatory genes in LI and Gl. 
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Table 5 illustrates top-40 discriminatory genes in Gl and G2 . 
Table 6 illustrates top-40 discriminatory genes in G2 and G3 . 

Best Mode for Carrying out the Invention 

The following examples merely illustrate the preferred 
method for identification and use of genes and/or proteins that, 
are differently expressed in non-cancerous liver, 
pre-cancerous liver, well differentiated HCC, moderately 
differentiated HCC, and poorly differentiated HCC. 

Herein below, the present invention will be 
specifically described using examples, however, it is not to 
be construed as being limited thereto. 

Example 1. Preparation of human tissues 

Fifty patients underwent surgical treatment for HCC at 
Yamaguchi University Hospital between May 1997 and August 2000 . 
Written informed consent was obtained from all patients before 
surgery . . The study protocol was approved by the Institutional 
Review Board for the Use of Human Subjects at the Yamaguchi 
University School of Medicine. All of the 50 patients were 
seropositive for HCV antibody (HCVAb) and seronegative for 
hepatitis B virus surface antigen (HBsAg) . A histopathological 
diagnosis of HCC was made in all cases after surgery. This 
histopathological examination showed that seven patients had 
well differentiated HCC (Gl) , 35 had moderately differentiated 
HCC (G2) , and the remaining eight had poorly differentiated HCC 
{G3) . Clinicopathologic factors were determined according to 
the International Union against Cancer TNM classification. 
Fisher's exact test. Student's t test, and Mann-Whitney's U 
test were used to elucidate the differences in 
clinicopathologic characteristics among the 3 grades, Gl, G2 
and G3 HCC. P<0.05 was considered significant. 

Six non-cancerous liver samples were obtained from six 
patients who underwent hepatic resection for benign or 
metastatic liver tumors, and confirmed to have histologically 
normal livers. They were all seronegative for both HBsAg and 
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HCVAb. Five HCV-inf ected liver samples were also prepared from 
the non-tumorous areas of five patients with HCC. All five 
liver samples were histopathologically diagnosed as chronic 
hepatitis or liver cirrhosis. Informed consent in writing was 
obtained from all patients before surgery. 

Exantple 2. Clinicopathologic characteristics of HCCs 

Histological examinations showed that, among the 50 
HCV-associated HCCs enrolled in this study, seven were well 
differentiated HCC (Gl) , 35 were moderately differentiated HCC 
{G2) , and the remaining eight were poorly differentiated HCC 
{G3) (Table 2) . The tumor size of G2 and G3 HCCs was 
significantly larger than that of Gl HCC {p=0 . 0007 and p=0 . 028 , 
respectively, by Mann-Whitney's U test) . The incidence of 
vessel involvement in G2 and G3 HCCs was significantly higher 
than that in Gl HCC (p=0;038 by Fisher's exact test). In 
parallel to dedif f erentiation from Gl to G3 , tumor stage was 
more advanced (p=0 . 066 by Fisher's exact test) . Thus , each type 
of Gl, G2, and G3 HCCs enrolled in this study showed 
characteristics corresponding to dedif f erentiation , i.e., 
tumor size, metastatic potential, and tumor stage, as proposed 
by Kojiro (Kojiro, M. Pathological evolution of early 
hepatocellular carcinoma. Oncology 62, 43-47 (2002)). 

Example 3. Extraction of the RNA from tissues 

Pieces of the tissues (about 125 mm^) were suspended in 
TRIZOL (Life Technologies, Gaithersburg, USA, Catalog No. 
15596-018) or Sepasol-RNAI (Nacalai tesque, Kyoto, Japan, 
Catalog No. 306-55) and homogenized twice with a Polytron 
(Kinematica, Littau, Switzerland) (5 sec at maximum speed) . 
After addition of chloroform, the tissues homogenates were 
centrifuged at 15,000 x g for 10 min, and aqueous phases, which 
contained RNA, were collected. Total cellular RNA was 
precipitated with isopropyl alcohol, washed once with 70% 
ethanol, and suspended in DEPC-treated water (Life 
Technologies, Gaithersburg, USA, Catalog No. 10813-012). 
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After treated with 1.5 units of DNase I (Life Technologies, 
Gaithersburg, USA, Catalog No. 18068-015) , the RNA was 
re-extracted with TRIZOL/chlorof orm, precipitated with 
ethanol, and dissolved in DEPC-treated water. Thereafter, 
5 small molecular weight nucleotides were removed by using RNeasy 
Mini Kit (QIAGEN, Hilden, Germany, Catalog No. 74104) according 
to a manufacturer's instruction manual. Quality of the total 
RN^ was judged from the ratio of 28S and 18S ribosomal RNA after 
agarose gel electrophoresis . The purified total RNA was stored 
10 at -80 °C in 70% ethanol solution until use. 

Exaiople 4. Synthesis of cDNA and labeled cRNA probies 

cDNA was synthesized by using reverse Superscript 
Choice System (Life Technologies, Gaithersburg, USA, Catalog 
15 No. 18090-019) according to the manufacturer's instruction 

manual. Five micrograms of the purified total RNA were 
hybridized with oligo-dT primers (Sawady Technology, Tokyo, 
Japan) that contained sequences for the T7 promoter and 200 
units of SuperScriptll reverse transcriptase and incubated at 

20 42 °C for 1 hr. The resulting cDNA was extracted with 

phenol/chloroform and purified with Phase Lock Gel™ Light 
(Eppendorf, HanJourg, Germany, Catalog No. 0032 005.101). 

cRNA was also synthesized by using MEGAscript T7 kit 
(Ambion, Austin, USA, Catalog No. 1334) and cDNA as templates 

25 according to the manufacturer's instruction. Approximately 5 
]ig of the cDNA was incubated with 2 ]il of enzyme mix containing 
T7 polymerase, 7.5 mM each of adenosine triphosphate (ATP) and 
guanosine triphosphate (GTP) , 5.625 mM each of cytidine 
triphosphate (CTP) and uridine triphosphate (UTP) , and 1.875 

30 mM each of Bio-ll-CTP and Bio-16-UTP (ENZO Diagnostics, 

Farmingdale, USA, Catalog No. 42818 and 42814, respectively) 
at 37 °C for 6 hr. Mononucleotides and short oligonucleotides 
were removed by column chromatography on CHROMA SPIN +STE-100 
column (CLONTECH, Palo Alto, USA, Catalog No. K1302-2) , and the 

35 cRNA in the eluates was sedimented by adding ethanol . Quality 
of the cRNA was judged from the length of the cRNA after agarose 
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gel electrophoresis. The purified cRNA was stored at -80 °C 
in 70% ethanol solution until use. 

Example 5. Gene expression analysis of HCC in different 
5 differentiation grade 

Gene expression of human primary tumors from glioma 
patients was examined by high-density oligonucleotide 
microarrays (U95A array, Af f yitietrix , Santa Clara, USA, Catalog 
No. 510137) (LipShutz, R.L. et al. High density synthetic 

10 oligonucleotide arrays, Nat. Genet. 21, 20-24 (1999)). For 
hybridization with oligonucleotides on the chips , the cRNA was 
fragmented at 95 °C for 35 min in a buffer containing 40 mM Tris 
.(Sigma, St. Louis, USA, Catalog No. T1503) -acetic acid (Wako, 
Osaka, Japan, Catalog No. 017-00256) (pH 8.1) , 100 mM potassium 

15 acetate (Wako, Osaka, Japan, Catalog No. 160-03175), and 30 mM 
magnesium acetate (Wako, Osaka, Japan, Catalog No. 130-00095). 
Hybridization was performed in 200 ]il of a buffer containing 
0.1 M 2- (N-Morpholino) ethanesulf onic acid (MES) (Sigma, St. 
Louis, USA, Catalog No. M-3885) (pH 6.7) , 1 M NaCl (Nacalai 

20 tesque, Kyoto, Japan, Catalog No. 313-20), 0.01% 

polyoxylene (10) octylphenyl ether (Wako, Osaka, Japan, Catalog 
No. 168-11805) , 20 )ig herring sperm DNA (Promega, Madison, USA, 
Catalog No. D181B) , 100 ]ig acetylated bovine serum albumin 
(Sigma, St. Louis, USA, Catalog No. B-8894) , 10 ]ig of the 

25 fragmented cRNA, and biotinylated-control oligonucleotides, 
biotin-5 ' -CTGAACGGTAGCATCTTGAC-3 ' ( Sawady technology , Tokyo , 
Japan) , at 45 °C for 12 hr. After washing the chips with a buffer, 
containing 0.01 M MES (pH 6.7), 0.1 M NaCl, and 0.001% 
polyoxylene (10) octylphenyl ether buffer, the chips were 

30 incubated with biotinylated anti-streptavidin antibody 

(Funakoshi, Tokyo, Japan, Catalog No. BA0500) and stained with 
streptavidin R-Phycoerythrin (Molecular Probes, Eugene, USA, 
Catalog No. S-866) to increase hybridization signals as 
described in the instruction manual (Affymetrix, Santa Clara, 

35 USA) . Each pixel level was collected with laser scanner 

(Affymetrix, Santa Clara, USA) and levels of the expression of 
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each cDNA and reliability (Present/Absent call) were 
calculated with Affymetrix GeneChip ver. 3.3 and Affymetrix 
Microarray Suite ver. 4.0 softwares. From these experiments, 
expression of approximately 11,000 genes in the human primary 
tumors of glioma patients was determined. 

Example 6. Statistical analysis of the oligonulceotide 
microarray data 

Genes with average differences greater than 40 
(arbitrary units by Affymetrix) in all the 50 HCC samples and 
the 11 non- tumorous {non-cancerous and pre-cancerous) liver 
samples were selected. This procedure yielded 3,559 genes out 
of approximately 11,000 . Next, the Fisher ratio was determined 
(lizuka, N., Oka, M. , Yamada-Okabe , H. , Mori,N., Tamesa, T., 
Okada, T. , Takemoto, T. , Tango ku, A. , Hamada, K. , Nakayama, H. , 
Miyamoto, T. , Uchiraura, S. , and Hamamoto, Y. Comparison of gene 
expression profiles between hepatitis B virus- and hepatitis 
C virus-infected hepatocellular carcinoma by oligonucleotide 
microarray data based on a supervised learning method. Cancer 
Res. 62, 3939-3944 (2002) andLuo, J., Duggan, D.J., Chen, Y., 
Sauvageot, J., Ewing, CM., Bi'ttner, M.L., Trent, J.M. , and 
Isaacs, W.B. Human prostate cancer and benign prostatic 
hyperplasia: molecular dissection by gene expression profiling. 
Cancer i^es. 61, 4683-4688 (2001)) to evaluate these genes as 
discriminators of LO from LI, LI from Gl, Gl from G2 , and G2 
from G3. The above 3,559 genes were ranked in the order of 
decreasing magnitude of the Fisher ratio. A random permutation 
test was also performed to detennine the number of genes to 
define the differentiation grade of HCC. The random 
permutation test was carried out as described previously 
(lizuka, N. , Oka, M. , Yamada-Okabe, H. , Mori, N. , Tamesa, T., 
Okada, T. , Takemoto, T. , Tango ku, A. , Hamada, K. , Nakayama, H. , 
Miyamoto, T. , Uchimura, S. , and Hamamoto, Y. Comparison of gene 
expression profiles between hepatitis B virus- and hepatitis 
C virus-infected hepatocellular carcinoma by oligonucleotide 
microarray data based on a supervised learning method. Cancer 
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Res. 62, 3939-3944 (2002) and Luo, J., Duggan, D.J., Chen, Y. , 
Sauvageot, J., Ewing, CM., Bittner, M.L., Trent, J.M., and 
Isaacs, W.B. Human prostate cancer and benign prostatic 
hyperplasia: molecular dissection by gene expression profiling. 
Cancer Res. 61, 4683-4688 (2001)) . In the test, sample labels 
were randomly permuted between two grades to be considered, and 
the Fisher ratio for each gene was again computed. This random 
permutation of sample labels was repeated 1,000 times. The 
Fisher ratios generated from the actual data were then assigned 
Ps based on the distribution of the Fisher ratios from 
randomized data. From the distribution of the Fisher ratios 
based on the randomized data, all of the genes that could pass 
the random permutation test {P< 0.005) were selected. This 
procedure was performed in all experiments for the comparison 
of two grades. As a result, 152 genes with the Fisher ratios 
higher than 4.90 were statistically significant discriminators 
between LO and Ll. Likewise, 191 genes with the Fisher ratios 
higher than 4.08 to discriminate Ll from Gl , 54 genes with the 
Fisher ratios higher than 1.52 to discriminate Gl from G2 , and 
40 genes with the Fisher ratios higher than 1.34 to discriminate 
G2 from G3, were identified. 

Example 7 . Selection of genes whose expression correlat:es vrith 
differentiation grade of HCC 

With oligonucleotide array data, changes in the gene 
expression during oncogenesis, i.e., from non-cancerous liver 
(LO) to HCV- infected pre-cancerous liver (Ll) and from Ll to 
well differentiated HCC (Gl) , and during dedif f erentiation of 
HCC (Gl to G2 and G2 to G3) were analyzed. The supervised 
learning method followed by a random permutation test 
identified 152 genes whose expression level was significantly 
changed during the transition from LO to Ll . Among the 152 genes , 
67 were upregulated and 85 were downregulated during this 
transition. In the same manner, 191 genes whose expression 
level was significantly changed during the transition from Ll 
to Gl HCC were identified. Among the 191 genes, 95 were 
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upregulated and 96 were downregulated during this transition. 
Fifty-four genes appeared to be differentially expressed 
between Gl and G2 HCCs , and among them the expression of 36 genes 
was increased and that of 18 genes was decreased during the 
5 transition from Gl to G2. Forty genes turned out to be 

differentially expressed between G2 and G3 HCCs , and among them 
the expression of 10 genes was increased and that of 30 genes 
was decreased during the transition from G2 to G3. 

To examine performance of the genes selected in each 

10 grade in the oncogeriesis and development of HCC, the inventors 
applied data of these genes to all samples. As a result, almost 
all of these genes selected in each transition stage were placed 
in LO-Ll transition, Ll-Gl transition, G1-G2 transition, and 
G2-G3 transition. For example, the 191 genes that discriminate 

15 LI from Gl HCC could clearly distinguish non-tumorous livers 
(LO and LI) from HCCs (Gl, G2 , and G3) (Fig. 1) . These results 
indicate that altered level of the selected genes plays central 
roles in determining each grade of HCC pathogenesis. 

20 Example 8 . Genes whose expression changed during the transition 
from non-cancerous liver (LO) to pre-cancerous liver (LI) 

ExiDression of most of immune response-related genes, 
metabolism-related genes, transport-related genes, 
proteolysis-related genes, and oncogenesis-related genes was 

25 increased, and that of transcription-related genes was 
decreased during the transition from LO to LI (Table 3) . 

Immune response-related genes include MHC. class I 
family (HLA-A, -C, -E, and -F) , MHC class II family (HLA-DPBl 
and HLA-DRA) , CD74, NK4, LILRBl , FCGR3B, and IFI30. 

30 Upregulation of an interferon (IFN) inducible gene such as 
IFI30 may represent host defense against viral infection; 
however, it should be noted that several IFN-related genes were 
decreased during dedif f erentiation of Gl to G2 as mentioned in 
the following section (see Example 10) . 

35 Metabolism-related genes include KARS, ALDOA, ASAH, MPI, 

and GAPD. Increased levels of KARS and ALDOA enhance protein 
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biosynthesis and glycolysis, respectively. Upregulaton of 
ASAH, MPI, and GAPD augments biosynthesis of fatty acid, 
mannose, and glyceraldehyde , respectively. 

Transport-related genes include VDAC3 , SSR4, BZRP, and 
ATOXl . SSR4 is responsible for the effective transport of newly 
synthesized polypeptides. ATOXl is a copper transporter and 
an increase in its expression causes activation of various 
metabolic pathways, because many enzymes require copper ion as 
a cof actor of enzymatic activity. 

Proteolysis-related genes include CST3 and CTSD . CST3 
is involved in vascular formation. Increased seriim level of 
CTSD protein was observed in cirrhotic patients who may develop 
pre-cancerous hepatic nodules (Leto, G. , Tumminello, F.M., 
Pizzolanti, G. , Montalto, G. , Soresi, M. , Ruggeri, I., and 
Gebbia, N. Cathepsin D serum mass concentrations in patients 
with hepatocellular carcinoma and/or liver cirrhosis, Eur. J. 
Clin. Chem. Clin. Biochem. 34, 555-560 (1996)). 

Oncogenesis-related genes include MBD2 , RPS19, RPS3, 
RPS15, and RPS12 . DNA methylation is a common epigenetic change 
in many malignancies, thus, DNA methylation patterns are 
determined by the enzymatic processes of methylation and 
demethylation. Upregulation of tffiD2 , which inhibits 
transcription from methylated DNA, plays an important role in 
downregulation of tumor suppressor genes carrying methylated 
DNA at their promoter regions. 

Downregulation of a transcription-related gene, RBICCI, 
was observed during the transition from LO to LI . The RBlCCl 
protein is a major regulator of the tumor suppressor gene RBI, 
thereby decreased levels of RBlCCl can promote oncogenesis via 
decreased activity of RBI protein. 

Thus, HCV-infected pre-cancerous liver is 
characterized by the altered expression of these genes , which 
suggests that initiation of hepatocarcinogenesis occurs during 
HCV infection. Among genes whose expression changes during the 
transition from LO to LI, those involved in proteolysis and 
oncogenesis may serve as molecular targets for chemoprevention 

24 



wo 2004/090163 



PCT/JP2003/004458 



of HCV-associated HCC. 

Example 9 , Genes whose expression changed during the transition 
from pre-cancerous liver (LI) to well differentiated HCC (Gl) 

Genes whose expression was altered during the 
transition from LI to Gl include most oncogenesis-related genes, 
signal transduction-related genes, transcription-related 
genes, transport-related genes, detoxification-related genes , 
and immune response-related genes (Table 4) . 

Oncogenesis-related genes such as BNIP3L, FOS, MAF, and 
IGFBP3 that can induce apoptosis of some cancer cells and IGFBP4 
that acts as an inhibitor of IGF-induced cell proliferation 
were downregulated during the transition, indicating 
downregulation of these genes is also important for the 
promotion of hepatocarcinogenesis . Previous report also 
showed the decreased expression of IGFBP3 and IGFBP4 in HCC 
compared with non-tumorous liver (Okabe, H. , Satoh, S., Kato, 
T., Kitahara, 0., Yanagawa, R. , Yamaoka, Y. , Tsunoda, T., 
Furukawa, Y. , and Nakamura, Y. Genome-wide analysis of gene 
expression in human hepatocellular carcinomas using cDNA 
microarray: identification of genes involved in viral 
carcinogenesis and tuirior progression , Cancer Res. 61, 
2129-2137 (2001) and Delpuech, O., Trabut, J.B., Carnot, F., 
Feuillard, J., Brechot, C, and Kremsdorf , D. Identification, 
using cDNA macroarray analysis, of distinct gene expression 
profiles associated with pathological and virological features 
of hepatocellular carcinoma. Oncogene 21, 2926-2937 (2002)). 
The data of the present inventi-on provide additional insights 
that downregulation of these two genes has already occurred in 
well differentiated HCC. MAF functions as a regulator for cell 
differentiation. BNIP3L induces cell apoptosis via inhibiting 
activity of BCL2 . In some cases, expression of FOS seems to 
be associated with apoptotic cell death. Thus, downregulation 
of these five genes is likely to trigger the transformation of 
hepatocyte after chronic HCV infection. 

Signal transduction-related genes such as CAMKK2 , GMFB, 

25 
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RALBPl, CDIPT, ZNF259, and RACl , and transcription-related 
genes such as DRAPl, ILF2 , BMIl, and PMFl were upregulated 
during the transition from LI to Gl . Other signal 
transduction-related genes such as CALMl , RAB14, TYROBP, and 
5 MAP2K1 were downregulated during this transition. 

Downregulation of TYROBP in Gl HOC may reflect decreased immune 
response. Alteration of the expression of genes involved in 
various signal transduction pathways may reflect a true 
portrait in well differentiated HCC arising from HCV-inf acted 

10 pre-cancerous liver . 

Transport-related genes such as TBCE, ATP6V1E, ATOXl, 
and SEC61G were upregulated, and those such as SLC31A1 and DDX19 
were downregulated. during the transition from LI to Gl. ATOXl 
that is an intracellular copper transporter was upregulated 

15 during the transition from LO to LI, and it was further 

upregulated during the transition from LI to Gl. Since an 
excessive copper is toxic or even lethal to the hepatocytes, 
distinct expression of ATOXl genes alters intracellular copper 
ion concentrations, thereby promotes DNA damage and cell injury. 

20 In fact, a recent study showed the preventive effect of 

copper-chelating agents on tumor development in the murine HCC 
xenograft model (Yoshii, J. , Yoshiji, H. , Kuriyama, S. , Ikenaka, 
Y. , Noguchi, R. , Okuda, H., Tsujinoue, H. , Nakatani , T., 
Kishida, H. , Nakae, D. , Gomez, D.E. , De Lorenzo, M.S., Tejera, 

25 A.M., and Fukui, H. The copper-chelating agent, trientine, 
suppresses tumor development and angiogenesis in the murine 
hepatocellular carcinoma cells. Int. J. Cancer. 94, 768-773 
(2001)). 

DNA damage and cell injury can be augmented by the 
30 downregulation of an antioxidant gene CAT and 

detoxification-related genes such as MTIH, MTIE, MTIF, MTlB, 
MT3, and UGT2B7 , promoting the dedif f erentiation of HCC. 

Using anti-hyaluronan receptor-1 antibody, Carreira et 
al. showed that the number of lymphatic vessels was smaller in 
35 HCC than in non-tumorous liver tissues such as liver cirrhosis 
(Mouta Carreira, C. , Nasser, S.M. , di Tomaso, E. , Padera, T.P. , 

26 
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Boucher, Y., Tomarev, S.I., and Jain, R.K. LYVE-1 is not 
restricted to the lymph vessels: expression in normal liver 
blood sinusoids and down-regulation in human liver cancer and 
cirrhosis. Cancer J?es. 61, 8079-8084 (2001)). In the present 
invention, expression of immune response-related genes such as 
ORMl, CIR, C6, IL4R, C8B, and CIS was decreased during the 
transition from LI to Gl, indicating that changes in 
microenvironment in HCC occur during the transition from LI to 
Gl. As reported previously, many genes encoding, complement 
component were downregulated during this transition (Okabe, H. , 
Satoh, S., Kato, T. , Kitahara, 0., Yanagawa, R. , Yamaoka, Y. ,. 
Tsunoda, T., Furukawa, Y. , and Nakamura, Y. Genome-wide 
.analysis of gene expression in human hepatocellular carcinomas 
using cDNA microarray: identification of genes involved in 
viral carcinogenesis and tumor progression. Cancer Res. 61, 
2129-2137 (2001) and lisuka, N. , Oka, M. , Yamada-Okabe , H. , 
Mori, N., Tamesa, T. , Okada, T. , Takemoto, T. , Tangoku, A., 
Hamada, K. , Nakayama, H. , Miyamoto, T. , Uchimura, S., and 
Hamamoto, Y. Comparison of gene expression profiles between 
hepatitis B virus- and hepatitis C virus-infected 
hepatocellular carcinoma by oligonucleotide microarray data 
based on a supervised learning method. Cancer Res. 62, 
3939-3944 (2002) ) . 

Example 10. Genes whose expression changed during the 
transition from well differentiated HCC (Gl) to moderately 
differentiated HCC {G2) 

Genes whose expression was altered during the 
transition from Gl to G2 include IFN-related genes, cell 
structure and motility-related genes, transcription-related 
genes, and tumor suppressor genes (Table 5) . 

During transition from Gl to G2 , the most prominent 
genetic changes appeared to be downregulation- of IFN-related 
genes such as 0AS2, STATl, PSMEl, ISGF3G, and PSMB9 . Similar 
genetic changes were also observed in prostate cancer cells 
(Shou, J., Soriano, R. , Hayward, S.W., Cunha, G.R., Williams, 
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P.M., and Gao, W.Q. Expression profiling of a human cell line 
model of prostatic cancer reveals a direct involvement of 
interferon signaling in prostate tumor progression, Proc. Natl 
Acad. Sci. U.S.A. 99, 2830-2835 (2002)) . IFN acts not only as 
an antiviral agent but also as an anticancer agent; however, 
certain types of HCC do not respond to IFN. Downregulation of 
the IFN-related genes can attenuate response of tumor cells to 
IFN, suggesting that resistance of HCC to IFN is exploited 
during the transition from Gl to G2 . Among the IFN-related 
genes, STATl appeared four times in our list of discriminators 
of Gl from G2 (Table 5) . Unlike other genes of the same family, 
STATl functions as a tumor suppressor (Bromberg, J.F. 
Activation of STAT proteins and growth control, Bioessays 23, 
161-169 (2001) ) . Interestingly, IFN treatment increases STATl 
expression in hepatocyte as well as many IFN-related genes 
(Radaeva, S., Jaruga, B., Hong, F. , Kim, W.H., Fan, S., Cai, 
H., Strom, S., Liu, Y., El-Assal, O. , and Gao, B. 
Interferon-alpha activates multiple STAT signals and 
down-regulates c-Met in primary human hepatocytes , 
Gastroenterology 122, 1020-1034 (2002)). Upregulation of 
STATl in HCC cell lines was observed during differentiation 
induced by sodium butyrate (Hung, W.C. and Chuang, L.Y. Sodium 
butyrate enhances STAT 1 expression in PLC/PRF/5 hepatoma cells 
and augments their responsiveness to interferon-alpha, Br. J. 
Cancer 80, 705-710 (1999)). The facts that STATl is a 
transcriptional target of the IGF- independent apoptotic effect 
of IGFBP3 (Spagnoli, A., Torello, M. , Nagalla, S.R., Horton, 
W.A., Pattee, P., Hwa, V., Chiarelli, F. , Roberts, C.T. Jr., 
and Rosenfeld, R.G. Identification of STAT-1 as a molecular 
target of IGFBP-3 in the process of chondrogenesis , J. Biol. 
Chem. 277, 18860-18867 (2002)) and that IGFBP3 is downregulated 
during the transition from LI to Gl strongly suggest that 
decreased expression of STATl during the transition from Gl to 
G2 HCC facilitate the further dedif f erentiation of HCC. 

Transcription-related gene TRIM16 that is involved in 
a variety of biological processes including cell growth. 
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differentiation, and pathogenesis, and tumor suppressor gene 
TPD52L2 that promotes cell proliferation were also upregulated 
during the transition from Gl to G2 . Upregulation of these 
genes in G2 HCC may promote growth and invasion of tumor cells . 

Example 11. Genes whose expression changed during the 
transition from moderately differentiated HCC (G2) to poorly 
differentiated HCC (G3) 

Genes whose expression was altered during the 
transition from G2 to G3 include proteolysis-related genes, 
BCL2-related gene, and metabolism- and energy gene2;ation- 
related genes (Table 6) . 

SPINTl and LGALS9 turned out to be upregulated during 
the transition from G2 to G3 . SPINTl is involved in regulation 
of proteolytic activation of hepatocyte growth factor (HGF) in 
injured tissues. Previously, Nagata et al. showed that 
transduction of antisense SPINTl (HAI-1) inhibited the growth 
of human hepatoma cells, suggesting that SPINTl plays an 
important role in the progression of HCC (Nagata, K. , Hirono, 
S., Ido, A., Kataoka, H., Moriuchi, A., Shimomura, T., Hori, 
T., Hayashi, K. , Koono, M. , Kitamura, N., and Tsubouchi, H. 
Expression of hepatocyte growth factor activator and 
hepatocyte growth factor activator inhibitor type 1 in human 
hepatocellular carcinoma, Biochem. Biophys . Res. Commun. 289, 
205-211 (2001) ) . LGALS9 belongs to a lectin family that is 
involved in cell adhesion, cell growth regulation, 
inflammation, immunomodulation, apoptosis, and metastasis. 
Several galectins are thought to be related to cancer cell 
adhesion (Ohannesian, D.W. , Lotan, D. , Thomas, P. , Jessup, J.M. , 
Fukuda, M. , Gabius, H. J. , and Lotan, R. Carcinoembryonic 
antigen and other glycoconjugates act as ligands for galectin-3 
in human colon carcinoma cells. Cancer Res. 55, 2191-2199 
(1995)). 

BNIP3, a BCL2-related gene, was downregulated during 
the transition from G2 to G3 . BNIP3 shares 56% amino acid 
sequence identity with BNIP3L. As mentioned above, expression 
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of BNIP3L was decreased during the transition from LI to .Gl . 
Because BCL2 functions as an anti-apoptotic factor, 
downregulation of BNIP3L and BNIP3 promotes oncogenesis, 
facilitating the dedif f erentiation of tumor cells. 

Many metabolism- and energy generation- related genes 
were also downregulated during this transition. In additi^on, 
expression of PGRMCl encoding a liver-rich protein that binds 
to progesterone and RARRES2 was also decreased during the 
transition from G2 to G3.- Decreased expression of RARRES2 may 
be the causative of poor response of G3 HCC to retinoic acids . 

Example 12. Color display of the expression of the selected 
genes in each transition stage 

Expression of 152 genes whose expression was 
significantly altered during the transition from LO to LI (Fig. 
la) , 191 genes whose expression was significantly altered 
during the transition from LI to Gl (Fig. lb) , 54 genes whose 
expression was significantly altered during the transition 
from Gl to G2 (Fig. Ic) , and 40 genes whose expression was 
significantly altered during the transition from G2 to G3 (Fig. 
Id) was shown by color display. These genes clearly 
distinguished the samples in the .two consecutive 
differentiation grades. Fig. le-h indicate the expression of 
the selected 40 genes in each transition stage in all the 
samples. Expression of the selected 40 genes whose expression 
was significantly altered during the transition from LO to LI 
(Fig. le) , from LI to Gl (Fig. If), from Gl to G2 (Fig. Ig) , 
and from G2 to G3 (Fig. Ih) was also shown by color display. 
The selected 40 genes in each ttansition stage discriminated 
samples before and after the transition. 

Exan?)le 13. Validation of the selected 40 genes in each 
transition stage to distinguish the differentiation grade of 
HCC 

To validate discriminative performance of the selected 
40 genes in each transition stage, the minimum distance 
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classifier with the selected 40 genes in each transition stage 
was created. In each transition, the minimum distance 
classifier was constructed with the samples in consecutive two 
differentiation grades as indicated by the red bar (training 
5 samples) , and was applied to the samples in the remaining 
differentiation grades as indicated by the black bar (test 
samples) (Fig. 2) . The resulting classifier classified the 
test samples with the accuracy of 92% (Fig. 2a) , 98% (Fig. 2b) , 
84% (Fig. 2c) , and 100% (Fig. 2d) . 

10 

Example 14 . Analysis by the self-organizing map (SOM) algorithm 
of the genes whose expression changed during the transition 
from non-cancerous liver (LO) to pre-cancerous liver (LI) , from 
pre-cancerous liver (LI) to well differentiated HCC (Gl) , from 

15 well differentiated HCC (Gl) to moderately differentiated HCC 
(G2) F and from moderately differentiated HCC (G2) to poorly 
differentiated HCC (G3) 

Expression of the genes whose expression was 
statistically significantly different between non-cancerous 

20 liver (LO) and pre-cancerous liver (LI) , pre-cancerous liver 
(LI) and well differentiated HCC (Gl) , well differentiated HCC 
(Gl) and moderately differentiated HCC (G2) , moderately 
differentiated HCC (G2) and poorly differentiated HCC (G3) was 
analysed according to the method of MATLAB R13 with the SOM 

25 toolbox available in the web site, 

http://www.cis.hut.fi/projects/somtoolbox/ (Kohonen, 2001) . 
40 genes in each comparison between non-cancerous liver (LO) 
and pre-cancerous liver (LI) , pre-cancerous liver (LI) and well 
differentiated HCC (Gl) , well differentiated HCC (Gl) and 

30 moderately differentiated HCC (G2) , moderately differentiated 
HCC (G2) and poorly differentiated HCC (G3) were used. The 
vectors of neighboring cells were located close to each other 
in the 155-dimentional gene space (Fig. 3a) , where (m, n) 
indicated the cell located at m-th row and n-th column, NL-XX 

35 indicated samples from non-cancerous liver without HCV 

infection (LO) , IL-XX indicated samples from HCV-infected 
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pre-cancerous liver (LI) , Gl-XXT indicated samples from well 
differentiated HCC (Gl) , G2-XXT indicated samples from 
moderately differentiated HCC (G2) , G3-XXT indicated samples 
from moderately differentiated HCC {G3) . The map showed that 
the samples clearly formed a sigmoid curve in the order of LO, 
LI, Gl, G2, and G3 . G2 samples without vessel involvement (blue 
letters) were located close to Gl samples and G2 samples with 
vessel involvement (red letters) were located close to G3 
samples {Fig. 3a) . G2 samples without venous invasion were 
located close to Gl samples and G2 samples with venous invasion 
were located close to G3 samples. Thus, the SOM classified G2 
samples into two subtypes, i.e. , tumor with venous invasion and 
that without venous invasion, in the stream of 
dedifferentiation grade. When the distance between the 
neighboring clusters was shown by colors where red indicated 
long distance, the red cells in the upper area clearly- 
demonstrated that the non-tiamorous (non-cancerous and 
pre-cancerous) liver and HCC samples were relatively far apart 
in the 155-dimentional genes space (Fig. 3b) . 

Industrial Applicability 

Hepatocellular carcinoma (HCC) is one of the most common 
cancers worldwide. However, there is no therapy that can cure 
the disease.. This is presumably due to sequential changes in 
characteristics of cancer cells during the development and 
progression of the disease. Particularly, progression of 
cancer is often associated with the changes of differentiation 
grade of tumor cells. Diagnosis and management of such changes 
of cancer cells will make cancer therapy more effective. In 
the present invention, genes whose expression correlates with 
oncogenesis and development of HCC are identified. A 
supervised learning method followed by a random permutation 
test is used to select genes whose expression significantly 
changes during the transition from non-cancerous liver without 
HCV infection (LO) to pre-cancerous liver with HCV infection 
(LI) , from LI to well differentiated HCC (Gl) , from Gl to 
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moderately differentiated HCC (G2), and from G2 to poorly 
differentiated HCC (G3) . The minimum distance classifier and 
the self-organizing map (SOM) with the selected 40 genes whose 
expression is significantly altered in each transition stage 
can correctly predict the differentiation grade of tumor 
tissues. Thus, these genes can be used for diagnosing the 
differentiation grade of HCC and for screening anti-cancer 
agents for the treatment of HCCs in each differentiation grade . 
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Table 1. Clusters of samples profiled to LO , Ll , Gl, G2 , and 
G3. 



"TT^^T^ 


sample 


(1,1) 


IL-49, IL— 58, IL— 59, IL— 60, IL— 62 


(1,2) 




(1,3) 


NL-64, NL-65, NL— 68, NL-69 


(1,4) 


NL-56, NL— 67 


(1,5) 




(2,1) 




(2,2) 


G2— 34T 


(2,3) 




(2,4) 




(2,5) 
_ 


G2-16T, G2-29T, G2-45T 


G2-2T 


(3,1) 


Gl— 85T, Gl— 87T 


(3,2) 




(3,3) 


G1-42T 


G2-22T 


(3,4) 




(3,5) 




(4,1) 


Gl— 86T 


G2-105T 


(4,2) 


G1-26T 


(4,3) 




(4,4) 


G2— 8T, G2— 27T 


(4,5) 


G2-151T 


Jlill 


G1-147T, Gl— 165T 


(5,2) 




(5,3) 


'^^"^^'•^ 


(5,4) 




(5,5) 




(jZ— ZUi, (jZ—oyl 


(6,1) 




(6,2) 


G3-80T 


(6,3) 


G2-1T, G2-163T 


G2-161T 


(6,4) 


G2-28T, G2-155T 


(6,5) 


G2-90T 


(7,1) 


G3-107T 


(7,2) 


G3-25T 


(7,3) 


G2-46T, G2-62T, G2-171T 


G2-162T 


(7,4) 




(7,5) 


G2-37T 


G2-6T, G2-58T 
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(8,1) 


G3-35T, G3-81T, G3-174T 


(8,2) 


G2-49T 


G2-23T 




G2-12T 


(8,3) 


G2-10T 




G3-19T 


(8,4) 


G2-89T 


(8,5) 


G2-43T, G2-182T 
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Table 2. Clinicopathologic characteristics per study group. 



Factors 


Well 
(Gl) 


ModGirstBly 
(G2) 


(G3) 


P value 


Sex 








N.S. 


Male 


4 


24 


6 




Female 


3 


11 


2 




Age (year) 


65.3±2.6 


65.4±1.2 


67.2±3.3 


N.S. 


Primary lesion 








N.S. 


Single tumor 


6 


15 


2 




Multiple tumors 


1 


20 


6 




Tumor size (cm) 


2.0±0.3 


5.0±0.5 


6.0+2.5 


p=0.0007 
/Pi no \ 

(Gl vs G3) 


Stage* 








p=0.066 


I 


6 


10 


2 




II 


1 


17 


3 




Venol^^ilvasion* 


0 


8 


3 














(-) 


7 


21 


3 




(+) 


0 


14 


5 




Non-tumorous liver 








N.S. 


Normal or chronic 
hepatitis 


2 


15 


2 




Liver cirrhosis 


5 


20 


.6 





*, Tumor differentiation, stage, and venous invasion were 
5 determined on the basis of TNM classification of UICC. 

Fisher's exact test. Student's t test, and Mann-Whitney's U 
test were used to elucidate the differences in backgrounds 
between each differentiation grade. 
N.S., not significant. 



BNSDOCID: <WO__2004090163A1 J_> 



wo 2004/090163 



PCT/JP2003/004458 



(0 o o 
M 3 M 



BNSDOCID: <WO ^2004690163A1 .L> 



wo 2004/090163 



PCT/JP2003/004458 



(0 



38 



3NSDOCID: <WO__2004090163A1_L> 



wo 2004/090163 



PCT/JP2003/004458 





-H tt 

0) -r- 

C! +- 

if-i a 
o c 
> 

O 0 
nj ^ 

U -r-l 
O CQ 
rH (!) 

CO xi 
C 4-) 

(0 a 

M > 


redox reaction 


(1 

H a 

>i > 

4J r- 

!-l C 
fd <L 

a, 4- 
m 0 
(d ^ 

H ^ 
(d 0 
B to 
o (d 

to 0) 
O 4J 

m o 
>i u 
■H P 


oncogenesis/actin 
cytoskeleton 
reoraanizati r>n 


energy pathway 
immune response 


flow of cholesterol into 
mitochondria 

unknown 


signal transduction 


energy generation 


immune response 




X 


14q21.3 


llpl5.5 


3p21.3 


CM in 
i-i 


22ql3.31 
12q24.1 


o 


ID 
rH 


rH 

CM 

a 


'H 
O 


SSR4 


TXNDC 


CTSD 


ARHA 


C0X4I1 
CD74 


BZRP 
NIFU 


CAMK2G 


CYBA 


HLA-A 


f-' 

- 0 

•H 

■ ' 0 ■ 
m 

Q) 
Q 

Q) 

0 

a 


signal sequence 
receptor, delta 


thioredoxin-related 
transmembrane protein 


cathepsin D 


ras homolog gene 
family, member A 


cytochrome c oxidase 
subunit IV isoform 1 
CD74 antigen 


receptor 

nitrogen fixation 
cluster-like 


calcium/calmodulin- 
dependent protein 
kinase II gamma 


cytochrome b-245, alpha 
polypeptide 


ma] or 

histocompatibility 
complex, class I, A 


Z69043 


AL080080 


M63138 


L09159 


AF017115 
M13560 


M36035 
U47101 


U81554 


M21186 


D32129 


1 ° 
« 


11.32 


11.17 


11.15 


11.12 

- 


10.99 
10.76 


10.22 
10.08 


9.70 


9.59 


9.47 



39 



BNSDCCID: <WO__2004090163A1 J_> 



wo 2004/090163 



PCT/JP2003/0T»4458 













ng 














ndi: 












nd 


-bi 








0] 

o 
a 


sponse 


(d 
>i 

P C 

•H 0 


.s/RNA 


G 
0 
-H 


mt'd) 


a 

0 
■H 

+J 


m 
u 

Q) 


ne re£ 


moti] 
skelet 


genesi 
ein 


adhes 


o 


D 


■w 




cell 
cyto 


onco 
prot 


cell 


and LI 


m 












3 in LO 


o 

0 


6p21 


Csl 

a 


13ql2- 


19qi: 


in 


gene; 


:_■ 




)PB1 


C2 


cyt 

rH 




ory 




is . 
1^ 


LA-r 


ARP 


RPS 


EH 

U 
















e 






Q 






ts 


-H 

O 
03 
-H 




+J - 

•H H 


>i - 

P H 
-H H 


rotei 
bunit 


rH 

G 

•H 


a) 
+J 

(d rH 
■ri 






ibil 
ass 


ibil 
ass 


a P 

03 

TS 

Q) - 


rote 


ssoc 
Ipha 


L 


o 


(d o 


(d 0 


■P x 
fd a) 

H H 


a 

rH 


(d nJ 
I 

d - 


o 

EH 


I-j 

0. 

i~{ 


O X! 
O (\) 

u o ^ 
o -P a 


0 X 

.0 0) H 
M 0 H 

0 -P a (d 


In re 
comp 


m 
1 

03 


■H <U -H 
G ^ (1) 
(D T! P) 


■ble 


Desc 


•n m g 
(d -H 0 

E 4:; o 


-n 01 6 P 
td 0 <D 
B xi 0 Xi 


P cn 
0 \ 
(d (N 


ribc 


P fd 0 
(d 0 M 

0 — a 


to 

EH 












0 
00 






AL02; 


M83I 


u5o: 


MBl' 


AFio: 












CM 
0 














<y> 





BNSDOCID: <WO__2604090163A1.L> 



wo 2004/090163 



PCT/JP2003/004458 



•H n. 



2 s 
^ 2 

o ^ 



41 



BNSDCCID: <W O 2 004090163A1 I > 



wo 2004/090163 



PCT/JP2003/004458 



3 m o 
3 a <u 



(0 u> 



42 



BNSDOCID: <WO 2004090 163A1.L> 



wo 2004/090163 



PCT/JP2003/0T)4458 



CQ O 

«3 -P 



Cn o 



H O Ti 



(0 ^) rH 
E -H O 

m M ft 



43 



wo 2004/090163 



PCT/JP2003/004458 



It possess low similarity to 
viral receptor 


proton transport 


signal transduction/ 
calcium-binding protein 


central nervous system 
development 


copper homeostasis and ion 
transport 


transcription 


<N 

CQ 




CM 








3 cn 

O (N 
- 0 tJ* 


IN 
CM 


rH 


t-i 
ft 

CM 


IN 

in 


H 




> 




00 
o 

cu 


X 
o 




>) ^ 

m ^ 


Eh 

<: 


o 


fa 






'; ' ^< 

0) 

' 3 ' 

0 XI H 
M O -H 

Of tji m 
•H o m 
u a u 

V J3 (D 

m Eft 

Q H to 


> 

Q 

- a; 

+ -H 

au 4-> H H 
M (0 
- O g +J 
0) ft O -H 

m CQ 03 c 

(C m o p 

&( (d m X! 
H M >, 3 
<; rH to 


(0 

+J 

« Xi 

H -H - 

O O (U 
e M CQ 

ft (fl 

fd a 

O -P -H 
^ f! ^ 

•H C m 

o a) «3 

H ft C 

(d a) -H 

O X} M 


o 

a 

•^^ 

Q> 

p 

ft ^ 

a) e 
&i ~- 

•H &> 

in o 

o o 
C B 

-H O 


+J o 

rti o 
TI g 
•H Q 

0 

•H vH 
4-) 

G R 
(d -H -P 

<D m 

rH 4-> (d 
X! O OJ 

Pi: ft — 


s 

!h in 
u 

(d (N 
a u 

Q> O 
P 
C P 
-H (d 

^ m 

(D tn 
rH a 

U -H 

a> Ti 
p d 

C -rH 

■H X) 


'M. . 
' 00 

? t^ 




o 




o 


CN 


g o 
R g 


<^ 
X 


o 


Q 


O 


o 


|5 ^ 


o 




rH 


rH 
00 


o 

rH 


^° 


o 




cn 


cn 





44 



BNSDCXJID: <WO 2004090163A1 I > 



wo 2004/090163 



PCT/JP2003/004458 



CM 
CD 

n 




Bp 

•la ' 
No 

•H 

■P 
U 

■ W - 


.antiviral response 
protein/ IFN-inducible 


detoxification 


unknown 


transcription/ 
IFN-signaling pathway 


transcription 


proteolysis and 
peptidolysis/IFN-inducible 


potassium transport 


proteolysis and 
peptidolysis/IFN-inducible 


proteolysis and 
peptidolysis 


genes in Gl ar 






12q24.2 


CM 

Y 

CM 

&• 




CM 
<M 


CM 




12pll.23 


14qll.2 


llql2 


iminatory 


t 

0 
m 

•H 


H 
0 

1 


OAS 2 


ADHIA 






HSF2 


PSMEl 


KCNJ8 


PSMEl 


UBE2L6 


Table 5. Top-40 discr 


egulated in G2 in conipai 


■ 

o 

•H 

i 

m 
0) 
Q 


2 ' -5 ' -oligoadenylate 
synthetase 2 


class I alcohol 
dehydrogenase alpha 
subunit 


hypothetical protein 
FLJ20378 


signal transducer and 
activator of 
transcription 1 


heat shock 

transcription factor 2 


proteasome activator 
faubunitl 


potassium 

inwardly-rectifying 
channel subfamily J, 
members 


proteasome activator 
subunitl 


ubiquitin-conj ugating 
enzyme E2L6 






■mbar 






5844 




oo 








3502 




genes 


'm 


M87 




AI62 


M97 


CT) 
Ch 


L07 


D50 


1 U07 


oo 




Fifteen 




2.89 


2. 53 


2.51 


2.43 


2.12 


2.08 


2.06 


2.02 


CM 



45 



wo 2004/090163 PCT/JP2003/004458 















"% 




















a thwa 


:ion 


lion 


ithwa 


■H 

O 


thwa: 






tion, 












O. 
a tn 

O C! 


isdud 


isdud 


ft 

d tn 
o c 


o 

OT 

m 


ft 

f! tn 
O G 






lifiCc 


c 
o 




-d 

G 
rfl 

n 


nt'd) 


o 

.••rH-. 


script: 
signal; 


al trai 


al trai 


script; 
signal: 


otubul? 
ein 


scripti 
signalj 




G^ 

•H 


ein moc 
ing 


scriptd 


lation 


solysis 
Ldolysi 


o 




tran 
IFN- 


sign 


sign 


tran 
IFN- 


micr 
prot 


tran 
IFN- 




proti 
bind 


tran; 


tans. 


■p +J 

O ft 
M <u 
ft ft 


CM 

a 
n 




























s in Gl ar 


L'. ; ]s 


2q32.: 


4q22 


12q 


CS] 

CM 
ro 
tr 
CM 


12pl3. 


2q32.5 

1 


:ii Gl 


P 

o 


00 

rH 
CM 

cn 


17q23.: 


3q27-qt( 


2q23.3 


atory gene 


Symbol 


STATl 


LIM 


FLN29 


STATl 


GABARAPL 
1 


STATl 


•rl 

.o 

Kl 
■H 
M 

n> 


—1 

o 

1 


RPNI 


ZNF161 1 


EIF4G1 


FNBP3 


a 

■H 

S 
















1 






rH 




G 

■H 










2Ct 


m 


Tj 

ai 










G 


fl 
0 

G -H 


0.1 
•P 
O 


ble 5. Top-40 dis 


o 

•H 

u- ■ 

CQ 
0) 
Q 


signal transduce] 
activator of 
transcription 1 


LIM protein 


FLN 29 gene prodi 


signal transduce! 
activator of 
transcription 1 


GABA (A) 

receptor-associat 
protein like 1 


signal transducer 
activator of 

transcription 1 


les upregrulated in G2 ; 


Descrip-cion "•; 


ribophorin I 


zinc finger prote. 


eukaryotic protei 
synthesis initiat 
factor 4 gamma 


formin binding pi 
3 


EH 






















00 

rH 


o 

o 




I 

;e) , 


M9 79 


AF0 61 


AB007 


M979 


W282 


M979: 


five gej 


-,-03 
CD 


Y002i 


D28i: 


AF1045 


AA6755 






1.85 


1.83 


1 1.74 


1.72 




1.66 


Twenty-: 


a5 ^ 


4.41 


3.25 


2.83 


2.27 



46 

BNSDQCID: <WO_^.2004090163A1_L> 



wo 2004/090163 



PCT/JP2003/004458 



) -P LO 



Hi fd 
Eh M 



47 



wo 2004/090163 



PCT/JP2003/004458 



0 
•H 

D 

- 


unknown | 


proteolysis and 
peptidolysis/apoptosis 


transcription 


ubiquitination 


unknown 


cell proliferation 


cholesterol biosynthesis 


RNA helicase 


Locus 


lpter-p22.1 


H 


2q31 


Xpll.23- 
pll.l 


20ql3.13- 
ql3.2 


20ql3.2- 
ql3.3 


CM 


14q32.2 


1 


KIAA0494 


SIAHl 


SP3 


UBQLN2 


ADNP 


TPD52L2 


FDPS 


DICERl 


O 

ft 
•H 

a) 

: Q 


KIAA0494 gene product 


seven in absentia 
homolog 1 (Drosophila) 


Sp3 transcription 
factor 


ubiquilin 2 


activity-dependent 
neuroprotector 


tumor protein D52-like 
2 


farnesyl diphosphate 
synthase 


Dicerl, Dcr-1 homolog 
(Drosophila) 


GB nurpb^r ■ 


AB007963 


U76247 


X68560 


AB015344 


AB018327 


AF004430 


D14697 


AB028449 


Fisher 
ratio 


1.74 


1.74 


1.73 


1.73 


1.73 


rH 


1.67 


1.67 



48 



wo 2004/090163 



PCT/JP2003/004458 













tn 




yrate 






to 




m 

•H 








lism 


activity 


sociatin 
duction 


C to m 


aminobut 




o 


iosynthe 




roteolys 






•rl 

I-' 


Ld metabo 


Loxidant 


/RaplA-as 
lal trans 


ascriptio 
ryogenesi 
3hogenesi 


:tron 

isporter/ 

ibolism 


to 

O 


luitinati 


lo acid b 


G 
O 


.-cell 
lalling/p: 


CO 

T! 
C! 






a 

■H 


ant; 


Ras/ 


trai 
emb] 
mort 


elec 
trar 
cate 


a 
o 
a 


ubic 


amir 


unkr 


rH tn 

0) -H 

u w 


nJ 
















CM 




CM 






nes in G2 




0 


19ql3.2 


5q31 


CM 


CM 

a 

X 


CM 

a 


4qll.2-ql 


5q35.1 


2q22-q24. 




15q26 


0) 
tn 


























iminatory 


■ri 

a 
o 
m 


-g . 

CO 


1 APOCl 


SEPPl 


PDZ-GEFl 


SCML2 


ALDH5A1 


BNIP3 


FBXWIB 


PAH 




PACE 4 


scr 


■H 

u 
(tj 








tn 


0) 


a) 

m 










id 


. Top-40 di 


1 

o 

•H 

CO 


1 

ri 


Dtein C-I 


■H 

0) 


1 containin 
acleotide 
factor 1 


n midleg-li 
lila) 


iehydrogena 
member Al 


jvirus ElB 
racting 


WD-40 doma. 

i 


0) 

a 

•H 0) 


;ns clone 
i. sequence 


ic amino ac 
system 4 


Table 6 


gulated in 


Descripti< 


1 apolipoprc 


selenoprol 
plasma, 1 


PDZ domaii 
guanine ni 
exchange i 


sex comb o 
2 (Drosopl 


aldehyde c 
5 family, 


BCL2/adenc 
19kD intei 
protein 3 


F-box and 
protein IE 


phenylalar 
hydroxylas 


Homo sapie 
24473 mRNi! 


paired bas 
cleaving s 




wnre 












o 




en 




o 






3 do 




1^ 
r- 




o 


o 
o 




o 






LT) 


00 

o 




genes 


(30 


AA! 




AB( 


>H 


AL( 


o 
< 


o 




AFC 






irty 


2- 
^ p 


m 


<N 




O 
00 


vo 




ID 

vo 




CM 
U3 










CM 


CM 


rH 


rH 


rH 


rH 


rH 


rH 


rH 


rH 



49 



iNSDOCID; <WO ^2004090 163A1_L> 



wo 2004/090163 



PCT/JP2003/004458 



0 

'•rH 
■fJ 

O 


similar to Rattus 
norvegicus kidney-specific 
protein mRNA 


unknown 


cell cycle control 


retinoic acid 
receptor/ retinoic 
acid-inducble 


liver enzyme for glycine and 
bile acid metabolisms 


intracellular protein 
traffic 


transcription/cell cycle 


intracellular protein 
traffic 


fatty acid metabolism 


unknown 


mitochondrial 
enzyme/energy generation 


CO'.- 

; ,-0" 
' -^A 






14q22.1 


7q35 


9q22.3 


7p22.2 


14q24.3-q31 1 


12q24.31 




CSl 










MIG2 


RARRES2 


EH 

pq 


KDELR2 


CHESl 1 


RNP24 


FACL2 


KIAA0977 1 


HADHSC 




FLJ31305 fis or clone 
LIVER1000104 


unknown 


mitogen inducible 2 


retinoic acid receptor 
responder (tazarotene 
induced) 2 


bile acid Coenzyme A: 
amino acid 

W-acyltransf erase 


KDEL endoplasmic 
reticulum protein 

retention receptor 2 


checkpoint suppressor 1 | 


coated vesicle membrane 
protein 


fatty-acid-Coenzyme A 
ligase, long-chain 2 


KIAA0977 protein | 


L-3-hydroxyacyl- 
Coenzyme A 

dehydrogenase, short 
chain 


S-l 

(U/ 

i- 

pi 
« 


AI263099 


U22961 


Z24725 


U77594 


L34081 


M88458 


U68723 1 


X92098 


D10040 


AB023194 


AF001903 




1.59 


1 1.57 


1 1.57 


1.53 


1.49 


1.49 


1.48 1 


1.48 


1.44 


1.43 


1.42 



50 



BNSDOCID: <WO__2004090163A1J_> 



wo 2004/090163 



PCT/JP2003/004458 



0 (U 

i n e o 

1 C >( -H 

- Q) xi a 

O Q) ^ 

i-q U TS - 



■H T3 





•H 

OJ 




o 
ft 




o> 
c 


o 
•• . 


•H 

T3 
C 


• ;o 
' hi 


§ 


h 

- n 




iJ 








o 

,1 

- CO 


Q 




-H 

a> 




o 

M 
ft 

T) 

<i; 


■zLoh 


-H 

o 
o 


-f-i 
u 

- oi 

- o> - 

Q 


Q cs! 






i- 








. CO 






00 







51 



BNSDOCID; <WO___2004090163A1_L> 



wo 2004/090163 



PCT/JP2003/004458 

















nd 






fi 






0 










(C 
■H 






OTS 






:ific f 


and 


ion 


otein 


ation 


Stimul 
nses 


and 


tion 


d adhe 


it'd) 


ion 


itor spec 
tivator 


adhesion 
tasis 


y generat 


inding pr 


nal matur 


cellular 
lar respo 


adhesion 
tasis 


prolif era 


growth an 


o 
o 




inhib 
HGFac 


cell 
metas 


energ 


DNA-b 


neuro 


extra 
cellu 


cell 
metas 


cell 


cell 


and G3 


. -co 
'■■■p 
'■;a 


ro 
n 


rH 
rH 




3-qter 


(N 

cn 








CM 

!h 


CM 


.-..0 

\ fi 


lO 


rH 






ft 


o* 




pll 


cn 


C! 

•H 








12q2 








X 


rH 


ory genei 


.-1 
1 


SPINTl 


LGALS9 


CYBA 


BAZ2A 


PPTl 


3100A13 


LGALS9 


PIM2 1 


TGFBl 


+J 

(0 






















fi 

•H 




a) 




m 


o 












scrim 




ft 
>i 
■p 




, alpl 


4J .<! 

+J CM 
% « 


C 


ding 






wth 


rop-40 dis 


fi 

0 

■ri 

. +J 


protease 
or, Kunit 


fi 


ome b-245 
tide 


main adja( 
nger doma 


yl-protei 
erase 1 


Icium bin 
A13 


cyi 
fi 


ncogene 


rming gro 
beta 1 


ible 6. ^ 


, ."ft 
-H 

u 

OJ 

a) 

Q 


serine 

inhibit 

1 


galedti 


M ft 

U ft 
O >i 

>i o 
O ft 


bromodo 
zinc fi 


palmito 
thioest 


SlOO ca 
protein 


galecti 


pim-2 o 


trans fo 
factor. 


TO 


-CU' 


o 


CM 
00 




cn 




o 










fi 

m 


ABOOO 


AB006 


M211 


AB002 


U447 


AI541 


Z491 


U777 


M384' 






i> 


o 






TO 






cn 


n 




£1" 


CM 








rH 




rH 


rH 


rH 



52 



BNSDCCID: <W0 2 604090163A1 I > 



wo 2004/090163 



PCT/JP2003/004458 



CLAIMS 

1. A method of defining the differentiation grade of tumor 
with genes and/or proteins selected by the statistical 
analyses based on the expression level or pattern of the 
genes and/or proteins of hiiman tiimor tissues obtainable 
from cancer patients. 

2 . A method according to claim 1, wherein the human tissues 
are human liver tissues. 

3 . A method according to claim 2 , wherein the 
differentiation grade of tumor is' selected from the group 
consisting of non-cancerous liver, pre-cancerous liver, 
well differentiated hepatocellular carcinoma (HCC) , 
moderately differentiated HCC, and poorly 
differentiated HCC. 

4. A method according to claim 3, wherein the genes and/or 
proteins are differentially expressed between 
non-cancerous liver and pre-cancerous liver, 
pre-cancerous liver and well differentiated 
hepatocellular carcinoma (HCC) , well differentiated HCC 
and moderately differentiated HCC, or moderately 
differentiated HCC and poorly differentiated HCC. 

5 . A method according to any one of claims 1 to 4 , wherein 
the expression level or pattern of genes and/or proteins 
is examined by means of DNA microarray, reverse 
transcription polymerase-chain reaction or protein 
array . 

6. - A method according to claim 5, wherein the genes and/or 

proteins are selected in descending order of the Fisher 
ratio . 
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7 . A method according to claim 5 or 6 , wherein the number 
of the genes and/or proteins is between 40 and 100. 

8. A method according to claim 5 or 6, wherein the number 
of the genes and/or proteins is between 35 and 45. 

9. A method according to claim 8, wherein the number of the 
genes and/or. proteins is 40. 

10. A method of defining the differentiation grade of tumor, 
the method comprising steps of : 

(a) selecting genes and/or proteins that have the 
highest Fisher ratios in comparison between 
non-cancerous liver and pre-cancerous liver, 
pre-cancerous liver and well differentiated 
hepatocellular carcinoma (HCC) , well 
differentiated HCC and moderately differentiated 
HCC, or moderately differentiated HCC and poorly 
differentiated HCC; and 

(b) defining the differentiation grade of tumor by 
using the genes and/or proteins. 

11. A method of defining the differentiation grade of tiimor, 
the method comprising steps of: 

(a) determining the number of genes and/or proteins to 
define the differentiation grade of tumor; 

(b) selecting a number of genes and/or proteins decided 
in step (a) that have the highest Fisher ratios in 
comparison between non-cancerous liver and 
pre-cancerous liver, pre-cancerous liver and well 
differentiated hepatocellular carcinoma (HCC) , 
well differentiated HCC and moderately 
differentiated HCC, or moderately differentiated 
HCC and poorly differentiated HCC; 

(c) applying the data of genes and/or proteins selected 
in step (b) to all samples; and 
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(d) defining the differentiation grade of tumor. 

A method of defining the differentiation grade of tumor, 
the method comprising steps of: 

(a) determining the number of genes and/or proteins to 
define the differentiation grade of tumor; 

(b) selecting a niomber of genes and/or proteins decided 
in step (a) that have the highest Fisher ratios in 
comparison between non-cancerous liver and 
pre-cancerous liver, pre-cancerous liver and well 
differentiated hepatocellular carcinoma (HCC) , 
well differentiated HCC and moderately 
differentiated HCC, or moderately differentiated 
HCC and poorly differentiated HCC; 

(c) applying the data of genes and/or proteins selected 
■ in step (b) to all samples; 

(d) designing a minimum distance classifier with the 
data of genes and/or proteins selected in step (b) ; 

(e) applying the minimum distance classifier designed 
in step (d) to all samples; 

(f ) generating self-organizing map with the data of all 
the genes and/or proteins selected in step (b) ; 

(g) applying the self organizing map generated in step 
(f) to all samples; and 

(h) defining the differentiation grade of tumor. 

A kit for carrying out the method according to any one 
of claims 1 to 12, the kit comprises DNA chips, 
oligonucleotide chips, protein chips, probes or primers 
that are necessary for effecting DNA microarrays, 
oligonucleotide microarrays, protein arrays, northern 
blotting, RNase protection assays, western blotting, and 
reverse transcription polymerase-chain reaction to 
examine the expression of the genes and/or proteins 
selected by the statistical analyses in claims 1 to 12 . 
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14. Use of genes and/or proteins according to any one of 
claims 1 to 12 for screening anti-cancer agents . 



15. Use of antibodies specific to genes and/or proteins 

according to any one of claims 1 to 12 for treating tumors 
in different grades. 
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